Search CORE

7 research outputs found

OpenCL Actors - Adding Data Parallelism to Actor-based Programming with CAF

Author: A Klöckner
D Charousset
G Agha
G Agha
J Nickolls
JD Owens
K Wu
L Dagum
S Srinivasan
S Wienke
T Desell
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

The actor model of computation has been designed for a seamless support of concurrency and distribution. However, it remains unspecific about data parallel program flows, while available processing power of modern many core hardware such as graphics processing units (GPUs) or coprocessors increases the relevance of data parallelism for general-purpose computation. In this work, we introduce OpenCL-enabled actors to the C++ Actor Framework (CAF). This offers a high level interface for accessing any OpenCL device without leaving the actor paradigm. The new type of actor is integrated into the runtime environment of CAF and gives rise to transparent message passing in distributed systems on heterogeneous hardware. Following the actor logic in CAF, OpenCL kernels can be composed while encapsulated in C++ actors, hence operate in a multi-stage fashion on data resident at the GPU. Developers are thus enabled to build complex data parallel programs from primitives without leaving the actor paradigm, nor sacrificing performance. Our evaluations on commodity GPUs, an Nvidia TESLA, and an Intel PHI reveal the expected linear scaling behavior when offloading larger workloads. For sub-second duties, the efficiency of offloading was found to largely differ between devices. Moreover, our findings indicate a negligible overhead over programming with the native OpenCL API.Comment: 28 page

arXiv.org e-Print Archive

Crossref

REPOSIT

The Seamless Peer and Cloud Evolution Framework

Author: Chong F. S.
Desell T.
Duda J.
Nolfi S.
Rivas V. M.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 10/08/2016
Field of study

Evolutionary algorithms are increasingly being applied to problems that are too computationally expensive to run on a single personal computer due to costly fitness function evaluations and/or large numbers of fitness evaluations. Here, we introduce the Seamless Peer And Cloud Evolution (SPACE) framework, which leverages bleeding edge web technologies to allow the computational resources necessary for running large scale evolutionary experiments to be made available to amateur and professional researchers alike, in a scalable and cost-effective manner, directly from their web browsers. The SPACE framework accomplishes this by distributing fitness evaluations across a heterogeneous pool of cloud compute nodes and peer computers. As a proof of concept, this framework has been attached to the RoboGen open-source platform for the co-evolution of robot bodies and brains, but importantly the framework has been built in a modular fashion such that it can be easily coupled with other evolutionary computation systems

Infoscience - École polytechnique fédérale de Lausanne

Crossref

Ant-based Neural Topology Search (ANTS) for Optimizing Recurrent Networks

Author: A ElSaid
AER ElSaid
AG Ororbia II
G-B Zhou
JH Holland
JL Elman
KO Stanley
KO Stanley
N Srivastava
RK Sivagaminathan
S Hochreiter
S O’Donnell
T Desell
X Yao
XS Yang
Y-P Liu
Publication venue: RIT Scholar Works
Publication date: 09/04/2020
Field of study

Hand-crafting effective and efficient structures for recurrent neural networks (RNNs) is a difficult, expensive, and time-consuming process. To address this challenge, we propose a novel neuro-evolution algorithm based on ant colony optimization (ACO), called Ant-based Neural Topology Search (ANTS), for directly optimizing RNN topologies. The procedure selects from multiple modern recurrent cell types such as ∆-RNN, GRU, LSTM, MGU and UGRNN cells, as well as recurrent connections which may span multiple layers and/or steps of time. In order to introduce an inductive bias that encourages the formation of sparser synaptic connectivity patterns, we investigate several variations of the core algorithm. We do so primarily by formulating different functions that drive the underlying pheromone simulation process (which mimic L1 and L2 regularization in standard machine learning) as well as by introducing ant agents with specialized roles (inspired by how real ant colonies operate), i.e., explorer ants that construct the initial feed forward structure and social ants which select nodes from the feed forward connections to subsequently craft recurrent memory structures. We also incorporate communal intelligence, where best weights are shared by the ant colony for weight initialization, reducing the number of backpropagation epochs required to locally train candidate RNNs, speeding up the neuro-evolution process. Our results demonstrate that the sparser RNNs evolved by ANTS significantly outperform traditional one and two layer architectures consisting of modern memory cells, as well as the well-known NEAT algorithm. Furthermore, we improve upon prior state-of-the-art results on the time series dataset utilized in our experiments

Crossref

RIT Scholar Works

An Efficient New Hybrid ICA-PSO Approach for Solving Large Scale Non-convex Multi Area Economic Dispatch Problems

Author: A Bhattacharya
A Goudarzi
A Hadidi
A Selvakumar
AH Ghodrati
AI Selvakumar
AK Barisal
AL Desell
B Mandal
B Mohammadi-Ivatloo
B Mohammadi-ivatloo
B Mohammadi-Ivatloo
B Panigrahi
BH Chowdhury
C Kumar
C Wang
CC Kuo
CC Kuo
CL Chen
D Streiffert
G Elis
Giulio Binetti
GR Venkatakrishnan
H Lu
H Mingfu
I Ziane
K Bhattacharjee
K Chaturvedi
K Meng
K Mostafa
K Vaisakh
KO Alawode
KW Doty
L dos Santos Coelho
L Wang
M Basu
M Moradi-Dalvand
MJ Morshed
Mojtaba Ghasemi
N Ghorbani
N Nomana
P Roy
P Subbaraj
PS Manoharan
Q Niu
R Balamurugan
R Balamurugan
R Romano
RR Shoults
S He
S Khamsawang
S Pothiya
S Sayah
SD Helmick
SN Jap
T Jayabarathi
T Niknam
T Yalcinoz
V Hosseinnezhad
V Hosseinnezhad
Y Wang
Z Gaing
Z Ouyang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref